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Abstract 

We discuss connecting tables with zero-one entries by a subset of a Markov basis. 
In this paper, as a Markov basis we consider the Graver basis, which corresponds to 
the unique minimal Markov basis for the Lawrence lifting of the original configura- 
tion. Since the Graver basis tends to be large, it is of interest to clarify conditions 
such that a subset of the Graver basis, in particular a minimal Markov basis it- 
self, connects tables with zero-one entries. We give some theoretical results on the 
connectivity of tables with zero-one entries. We also study some common models, 
where a minimal Markov basis for tables without the zero-one restriction does not 
connect tables with zero-one entries. 
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1 Introduction 



Markov bases methodology initiated by lDiaconis and Sturmfelsl |1998j ] for performing con 



ditional tests of discrete exponential family models have been extensively studied in recent 
years. The set of contingency tables sharing values of the sufficient statistic is called a 
fiber. A Markov basis guarantees connectivity of all fibers by definition. Since the size 
of a Markov basis tends to be large for large-scale problems, researchers are interested in 
a subset of Markov basis to connect specific fibers. In most applications of Markov basis 
there are no restrictions on the cell counts. However in some pro blems, the co unts are 
either zero or one. The most well-known case is the Rasch model (jRaschl |l980j |) used in 
educational statistics. 

The Rasch model can be interpreted as a logistic regression (logit model), where the 
number of trials is just one for each combination of covariates. In this model, tables 
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with zero-one entries (zero-one tables) are elements of a specific fiber, where the marginal 
frequencies corresponding to the response variable are all equal to one in the logistic 
regression. 

In other cases, zero-one tables appear as truncation or dichotomization of a variable, 
where for example only an occurrence or non-occurrence of certain large event is recorded. 
A convenient statistical model for zero-one tables is a log-linear model for contingency 
tables, where the support of the distribution is restricted to zero-one tables. Then we can 
use the Markov basis methodology for conditional tests of the fit of the model. 

Two-way zero-one tables with structural zeros arise in many practical problems in 
ecological studies and social networks and exact tests of quasi-i ndependence mo d els via a 



Marko v basis has been studied for some specific problems (e.g. iRao et al.l [19961 ] . iRoberts 
200d |). 



Another source of zero-one tables is the set of incidence matrices satisfying certain 
combinatorial restrictions. For example, the set of Latin squares and Sudoku tables 
can be considered as a set of zero-one tables with fixed marginals. From combinatorial 
viewpoint it is of interest to construct a connected Markov chain over the set of these 
tables. 

Note that a minimal Markov basis without the zero-one restriction may not connect 
zero-one tables, because by applying a move from the Markov basis, some cells may contain 
frequencies greater than one. On the other hand, as clarified in Proposition 12. II in Section 
HI the set of square-free moves of the Graver basis connects tables with zero-one entries. 
Therefore it is of interest to study when a minimal Markov basis connects zero-one tables, 
and if this is not the case, to find a subset of the Graver basis connecting zero-one tables. 

In this paper we give some theoretical results on the connectivity of tables with zero- 
one entries. Unfortunately we found that our sufficient conditions for connectivity are 
satisfied only in a few examples. Therefore we investigate some common models, where a 
minimal Markov basis for tables without the zero-one restriction does not connect tables 
with zero-one entries. 

The organization of the paper is as follows. For the rest of this section we summarize 
our notation and preliminary facts. In Section [2] we give some theoretical results on 
connectivity of zero-one tables with a minimal Markov basis and with some other subsets 
of the Graver basis. In Section [3] we study connectivity of zero-one tables in some common 
models for contingency tables, including the Rasch model, its multivariate version and 
the quasi-independence model. We also discuss Latin squares. We conclude the paper 
with some remarks in Section HI 

1.1 Notation and preliminary facts 

Here we set up our notation and sum marize preliminary facts on Markov and Graver 



bases. We mostly follow the notation in lHara et al.l [2007l |. Let X denote the set of cells 



of a table, where i G X is usually a multi-index. Let A be the set of variables. Then a cell 
i is considered as a |A| dimensional vector i := (id)d£A- Denote by / = |X| the number of 
cells. For a subset D C A, denote by in := {id)d&D and Xp a D- marginal cell and the set 
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of _D-marginal cells, respectively. Define Zd := YideD-^d, and Ip := YideD ^d- 

A contingency table or a frequency vector is denoted by x = (#(«)) { ie x}. Let x(id) 
denote a marginal frequency, i.e. x{i D ) = £\ ceJ x{io,iD c )- For a given a;, supp(x) = 
{i | x(z) > 0} denotes the set of positive cells of x. Given a loglinear model (more precisely 
a toric model), the sufficient statistic t can be written as t = Ax for some integral matrix 
A. We call A a configuration of the model. I a denotes the toric ideal of A. Assume that 
I a is homogeneous, i.e. there exists a vector w such that 

w'A = (!,...,!) (1-1) 



(Lemma 4.14 in Sturmfels 19961 ] ) . The set Tt = {x > | t = Ax} of contingency tables 



with the common sufficient statistic t is called a fiber. 

An integer vector z is called a move if Az = 0. \z\ = denotes the Li-norm 

of z. Separating positive elements and negative elements of z, we write z = z + — z~~, 
where z + is the positive part of z and z~ is the negative part of z. The total sum of 
frequencies in z + (z~~) is called degree of z. For two moves z±, Z2, the sum Z\ + Z2 is called 
conformal if there is no cancellation of signs in z\ + z 2 > i-e., = supp(z^) fl supp(2^) = 
supp(zf ) H supp(;z^). A move z which can not be written as a conformal sum of two 
(non-zero) moves is called primitive. The set of primitive moves is finite and it is called 
the Graver basis of I a- We denote the Graver basis as £>gr- 

Let Ej denote the I x I identity matrix. The configuration 

HA) = y (1.2) 

is called the Lawrence lifting of A. In statistical terms, the Lawrence lifting corresponds 
to the logistic re gression, where t he interaction effects of the covariates are specified by 



A. It is known ( [Sturmfelsl . Il996l . Theorem 7.1]) that the unique minimal Markov basis 
of Ia(A) coincides with the Graver basis of Ia- 



A finite set of moves B is distance reducing ( Takemura and Aoki 20051 ] ) if for all t and 



for all x, y G Tt there exists an element z G B and e = ±1 such that 

x + ez G Tt-, \x + ez — y\ < \x — y\ or y + ez £ jF t , \x — (y + ez)\ < \x — y\. 

If B is distance reducing, it is obviously a Markov basis and we call B a distance reducing 
Markov basis. Furthermore B is strongly distance reducing if for all t and for all x, y G T± 
there exist elements Z\,z-i G B and ei,62 = ±1 such that x + e\Z\,y + e2Z 2 G Tt-, 
\x + e x z\ — y\ < \x — y\ and \x — (y + e 2 z 2 )\ < \x — y\. 

Since we are considering zero-one tables in this paper, let us denote 

f t = {x | t = Ax, x{i) = or 1}. (1.3) 

As in the usual setting for Markov bases, we call a finite set B of moves a Markov basis 
for zero-one tables, if B connects all fibers Tf If B is distance reducing for zero-one 
tables, then it is a distance reducing Markov basis for zero-one tables. Since there are 2l x ' 
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zero-one tables, there are only finitely many fibers and finitely many differences of two 
elements belonging to the same fiber. Therefore the set of these differences is the largest 
trivial Markov basis. However this set is clearly too large and we are interested in a much 
smaller set of moves connecting all fibers Tf 

2 Some theoretical results 

The starting point of our investigation of connectivity of zero-one tables is the following 
basic fact on the Graver basis #gr for I a- 

Proposition 2.1. Let Bq denote the set of square- free moves of the Graver basis Bqr of 
1a- Then Bq is strongly distance reducing for tables with zero-one entries. 

Proof. Let x, y be two zero-one tables of the same fiber. They are connected by a 
conformal sum of primitive moves 

y = x + zi H + z K . (2.1) 

Since there is no cancellation of signs on the right-hand side, once an entry greater than or 
equal to 2 appears in an intermediate sum of the right-hand side, it can not be canceled. 
Therefore it follows that x + Zi + ■ • • + z k G F t for k = 1, . . . , K and Zi, . . . , z K G Bq. 
Since there are no sign cancellations in (12.1 1) . z±, . . . , z^ can be added to x in any order 
and —Zi, . . . , — zk can be added to y in any order. Therefore Bq is strongly distance 
reducing. □ 

Since the Graver basis tends to be large, we are interested in conditions for connecting 
tables with zero-one entries with a subset of the Graver basis. We consider the following 
condition. 

Condition 2.1 (Existence of strong crossing pattern). Let ej denote the frequency vector 
with just 1 frequency in the i-th cell and otherwise. For every fiber and every x,y,x^ y, 
in the same fiber, there exist distinct cells ii,i2,i3,ii such that x(i\) > y(ii),x(i2) > 
y(i2),x(i 3 ) < y(i 3 ),x(U) < y{u) ory{h) > x(ii),y{i 2 ) > x(i 2 ),y(i 3 ) < x(i 3 ),y(u) < x{i A ) 
and 

Z 6j 3 -|- ej 4 (Bj 2 (2-2) 

is a move. 

Note the set B of the moves z in (12. 2p forms a distance reducing Markov basis. There- 
fore Condition 12.11 is a sufficient condition for existence of a distance reducing Markov 
basis consisting of square-free moves of degree two. However existence of such a Markov 
basis does not imply Condition 12. 1[ We discuss this point at the end of this section. 
Under Condition 12.11 we have the following result. 

Theorem 2.1. Under Condition \2.1\ the set B of moves 112 . £j) is distance reducing for 
tables with zero-one entries. 
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Proof. Let x, y be two zero-one tables in the same fiber. By Condition 12.11 we can find 
distinct cells i\, i 2 , 13, 14 such that 

> y(h) + 1, x(i 2 ) > y{i 2 ) + 1 

x{i\) = 1, x(i 2 ) = 1, y(ii) = 0,y(i 2 ) = 

and < x{iz) < yiiz) < 1 =>- ^(^3) = 0, 2/(^3) = 1. If = then we can add 

z = e.j 3 + ej 4 — — e.j 2 to x and reduce the Li-distance by four. Furthermore x + z is a 
table of zeros and ones. 

It remains to consider the case x{i±) = 1. Since x{i<±) < 2/(^4), we have = 1. 

Therefore = 0,y(i 2 ) = 0,y(z 3 ) = = 1. Then we can subtract z from ?/ and 

y — z is a table of zeros and ones. Furthermore \x — (y — z)\ = \x — y\— 2. Therefore under 
Condition 12.11 we can reduce the distance always by at least 2. Therefore B is distance 
reducing for fibers of zero-one tables. □ 

Theorem 12.11 is simple and effective to prove that a particular Markov basis connects 
zero-one tables for some simple configurations. We now present several generalizations of 
Theorem 12 . 11 The following proposition is an obvious extension of Theorem 12.11 and we 
omit a proof. 

Proposition 2.2. Assume that there exists a positive integer M, such that for every fiber 
and every x,y, x ^ y, in the same fiber there exists a positive integer m < M and distinct 
cells %im such that 

2m m 

z = E e %--5>; ( 2 - 3 ) 

j=m+l j=l 

is a move such that at least one of the following conditions hold: i) x(ij) > y{ij), j = 
l,...,m, x(ij) < y(ij), j = m + 1, . . . , 2m - I, x(i 2m ) < y(i 2m ), or ii) y(ij) > x(ij), 
j = 1, . . . , m, y{ij) < x{ij), j — m + 1, . . . , 2m — 1, y{i2 m ) < x(i 2m )- Then the set B of 
moves z in A 2. Sty is distance reducing for tables with zero-one entries. 

Proposition 12.21 suggests a possibility to choose a subset of Bo of Proposition ^. 11 which 
still guarantees the connectivity of tables with zero-one entries. Let B be a subset of Bq 
with the following property. 

Condition 2.2 (Generalized strong crossing pattern for the Graver basis). For every 
element z = z + — z~ £ B \B, there exists a move z' = Y^j= m +\ e ^ — Sj=i e i 3 ^ & such 
that ii, . . . ,i 2m are distinct and at least one of the following conditions hold: i) z + (ij) > 
z~(ij), j = l,...,m, z + (ij) < z~(ij), j = m+ 1, ...,2m- I, z + (i 2m ) < z~(i 2m ), or ii) 
z~(ij) > z + (ij), j = 1, . . . ,m ; z~(ij) < z + (ij), j = m+1, . . . ,2m-l, z~(i 2m ) < z + (i 2m ). 

Combining Proposition 12.11 and Proposition 12.21 we have the following proposition. 

Proposition 2.3. If B satisfies Condition \2.2\ then B is distance reducing for tables with 
zero-one entries. 
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Proof. As in the proof of Proposition 12.11 consider (12. ip . where x, y are two zero-one 
tables in the same fiber. By induction on the number K of primitive moves, it suffices to 
prove the distance reducti on for y — x + Z\. By the same argument as in Lemma 2.4 of 



Takemura and Aokil |2004j , it suffices to check the distance reduction by moves from B in 
moving from z^ to z\ . If Z\ G B, we can reduce the distance at once. If Z\ G Bq \ B, we 
can find z G B which can be applied either to zjf or zf such that \zi\ is reduced. The 
resulting move can now be decomposed into a conformal sum of primitive moves and we 
can recursively use the distance reduction argument. This proves the proposition. □ 

By Proposition 12.31 once Bq is given we can remove some elements from Bq and obtain 
a smaller set of moves B as follows. Find a pair z, z G Bo, z ^ z, such that z + z has just 
one sign cancellation, i.e. there is only one cell i such that z(i)z(i) = —1. If z + z G Bo 
then we can remove z + z from Bq and still guarantee connectivity of zero-one tables. 

As the last topic of this section we clarify the interpretation of Condition 12.11 by 
discussing a weaker condition which is equivalent to the existence of distance reducing 
Markov basis consis t ing o f square- free moves of degree two. In our previous works (e.g. 



Aoki and Takemura! 20051 ]. lHara et al.l 20071 ]) we have obtained such Markov bases and 



used a similar argument as in the proof of Theorem 12.11 By omitting the requirement 
< y{h) m Condition 12.11 consider the following weaker condition: 

Condition 2.3 (Existence of weak crossing pattern). For every fiber and every x, y, x ^ 
y, in the same fiber, there exist distinct cells i±, i 2 ,i 3 ,h such that x(i\) > y(ii),x(i 2 ) > 
y(i 2 ),x(i 3 ) < y(i 3 ) ory(ii) > x(ii),y(i 2 ) > x{i 2 ),y{h) < x(i 3 ) and z = r:,,, • a,, (>■, c, 
is a move. 

We now show that Condition 12.31 is equivalent to the existence of a distance reducing 
Markov basis consisting of square-free moves of degree two. 

Proposition 2.4. There exists a distance reducing Markov basis consisting of square-free 
moves of degree two if and only if Condition \2.3\ holds. 

Proof. It is clear that under Condition 12.31 the set B of moves in (12.21) is distance reducing. 
Therefore it suffices to show the converse. Let B be a distance reducing Markov basis 
consisting of square-free moves of degree two. Let x,y, x ^ y, be in the same fiber. We 
can find ±z G B such that z is applicable to x or y and \(x + z) — y\ < \x — y\ or 
| x — ( y + z) | < \x — y\, respectively. For x, y in the same fiber and for distinct indices 
ii,i2,h,U let 

g(k, hi y) = I(x(h) > y(h)) + I(x(h) > y(h)) 

+ I{x{i 3 ) < y(i 3 )) + I{x{i 4 ) < y{h)) - 2, 

where 1(E) denotes the indicator function of the event E. When z can be added to x, 
we have 

\x-y\ - \(x + z)-y\ = 2g(i 1 ,i 2 ,i 3 ,U,x,y). 
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Therefore 

|aj-i/|-|(aj + *)-y|>0 o g(n, i 2 , i 3 , k, x, y) > 0, 

i.e., at least 3 inequalities among x{i\) > y(ii), x(i 2 ) > y(i2),x(i 3 ) < y(iz), x(i 4 ) < t/(z 4 ) 
hold. It is easy to see that then Condition 12.31 holds. Similarly if z can be added to y 
and \x — (y + z)\ < \x — y\, then Condition 12.31 holds. □ 



3 Connectivity results for some models 

In this section we investigate connectivity of zero-one tables for some common models for 
contingency tables. 



3.1 Rasch model 

Rasch model (Rasch 1980j ] ) has long received much attention in the item response theory. 
Suppose that / persons take a test with J dichotomous questions. Let G {0, 1} be 
a response to the jth question of the zth person. Hence the I x J table x = (xij) is 
considered as a two-way contingency table with zero-one entries. Assume that each Xij is 
independent. Then the Rasch model is expressed as 



exp(cti - (5j) 
1 + exp(aj — (3 



(3.1) 



where on is an individual's latent ability parameter and fij is an item's difficulty parameter. 

]j =1 x^ and column sums x + j = J2i=i 



the sufficient 



Then the set of row sums x 
statistic for ai and 

The Rasch model has been extensively studied and practically used for evaluating 
educ ational and psycho l ogica l tests. Many inference procedures have been developed 



(e.g. 

as 



Glas and Verhelstl [1995^ and most of them rely on asymptotic theory. However, 



Raschl [19801 ] pointed out, a sufficiently large sample size is not necessarily expected in 
pr actice. I n suc h cases the asymptotic inference may be inappropriate 



Rasch 1980| proposed to use an exact test procedure. As mentioned in Rasch 1980 



the conditional distribution of zero-one tables given person scores and item totals is easily 
shown to be uniform. In order to implement exact test for Rasch model via Markov basis 
technique, we need a set of moves which conn ects every fiber of two-way zero-one tables 
with fixed row and column sums. iRyserl 19571 1 first showed that the set of basic moves 



J 



i i 
1 -1 
■1 1 



in two-way complete independence model connects any fiber of zero-one tables with 
fixed row and column sums. Since then, many Monte Carlo procedures via Markov 
basis technique to compute distribution of test statistics to test the goodness-of-fit of 
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the Rasch model have b een proposed (e.g. iBesag and Clifford! |l989j . iPonocnyl [2001^ 
Cobb and Chen] [2003] I IChen and Snml3|2005l provided a computationally more effi- 
cient Monte Carlo procedure for implementing exact tests by using sequential importance 
sampling. 

In the framework of the present paper, the connectivity by basic moves is a conse- 
quence of Theorem 12.11 and Proposition 12 . 11 The Rasch model can be regarded as the 
Lawrence lifting of the independence model for / x J tables. Assume that ii,i2, ■ ■ - ,i r 
and ji,j2, ■ ■ ■ ,j r are all distinct. Denote i[ r ] = (ii, . . . ,i r ), j[ r ] = (ji, . . . ,j r ). Then a loop 
of degree r 

«r(*[r]5 J[r]) = {%}, 1 < h, ■ ■ ■ , V < !■» 1 < Jl, - • -,jr < J, 

is defined by a move such that 



'1232 



"«2J3 



-l3r-l 
-ijr ~ 



= 1, 
-1, 



and all the other elements are zero (e.g. lAoki and Takemural 2005]). A loop of degree r 
is written as 



Ji J2 •■■ 



Jr-l Jr 



h 


1 


-1 


















1 


-1 










ir-l 












1 


-1 




-1 
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From p. 382 of iDiaconis and Sturmfelsl |1998| we know that the set of loops of degree 
r, r < min(J, J) forms the Graver basis for the complete independence model of I x J 
contingency tables. Since the set of basic moves satisfies Condition 12. 1[ where x is the 
positive part and y is the negative part of these loops, it follows that the set of basic 
moves connects I x J zero-one tables with fixed row and column sums. 



3.2 Many-facet Rasch model 



The many-facet Rasch mo del is an exten si on of the Ras ch model to multiple items and 
polytomous responses (e.g. iLinacrd 1989] . iLinacrd 1994] ) and has also been extensively 
used in practice f or evaluating ess a y exams and s coring systems of judged sports (e.g. 



Yamaguchil |1999] . IZhu et all |1998] . iBasturkl |2008j ). 



Suppose that I\ articles are rated by li reviewers from J 3 aspects on the grade of J 4 
scales from to J 4 — 1. x ili2 i 3i4 = 1 if the reviewer i 2 rates the article %\ as the z 4 th grade 



0. 



from the aspect i 3 and otherwise x ili2 i 3i4 
zero-one table. We note that x satisfies x 
Then the three-facet Rasch model for x is expressed by 



Then x - 



{ x ii 
X 



u } is an 7 X x I 2 x I 3 x I 4 
= 1 for all ii, ii and 



n*2«3«4 



P(Xi 



«1«2«3*4 



i: 



exp ~ Pit ~ ~ A 4 ] 

Ej=d ex P [«4(Ai - Pi 2 - A 3 ) - A. 



(3.2) 



s 



In general, the V-iacet Rasch model is defined as follows. Let x = {x(i)}, i := . . . , iv+x) 
be an Ji x • • • x Iy+i zero-one table. Assume that ly+i — {0, • • • , Iv+i ~ 1} an d that x 
satisfies 

Iv+i-l 

x(i{i,...,v}) ■= x ( < ) = L 

i v +l=0 

Then the l/-facet Rasch model is expressed as 

/.x -. \ eX P - A 2 - • • • - Ay) - Piv+i] o \ 

P(x(z) = 1) = — r — ^ : — -. (3.3) 

E»vS=o ex P ^v+i(Ai - A 2 - • • • - fry) - /WJ 

When y = 2, J 3 = 2 and /3; 3 = const for i 3 G {0,1}, the model coincides with the Rasch 
model (dUD. Define t° by 



Iv+i-l 

t° = \ ^ iv+l ' X (Hv,V+l}) 
i v+1 =0 



i{v,V+l} G T{ v y + i}, V — 1, . . . , V 



Then the sufficient statistic t is written by 

t = t° U {x(i v +i) | iv+i G Xy+i}. 
When /3j v+1 is constant for iy+i G Xy +1 , t is written by 

t = t°U{x + }, 

where x + := Eiex x (^)' I n ^ ne case °^ ^ ne three-facet Rasch model (13.21) . t is expressed 
as follows, 

h h 

H G Xi, 24X-|_j 2 +j 4 , ?2 G X2, 

.14=0 44=0 

/4 l 

23 G X 3 , x +++i4 , i 4 G X 4 >. 
j 4 =o J 

In order to implement exact tests for the many-facet Rasch model, we need a set of 
moves which connects any fiber jF t of zero-one tables. In general, however, it is not easy 
to derive such a set of moves. As seen in the previous section, in the case of the Rasch 
model (13. ip . the set of basic moves for two-way complete independence model connects any 
fiber. For the many-facet Rasch model (13.31) . however, the basic moves do not necessarily 
connect all fibers. Consider the case where V = 3 and J4 = 2. In this case, t° is written 
as 

t = {xi 1++ i, £_|_i 2 +i, £-|_+i 3i I i v G T v , v = 1, 2, 3}. 

Since x(z 4 ) = £\ 6Z x{i^ v ^) for v = 1,2,3, t° is the sufficient statistic. t° is equivalent 
to the sufficient statistics of three-way complete independence model for (z 4 = l)-slice of 
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x. From Proposition 12. 1\ the set of square-free moves of the Graver basis for three-way 
complete independence model connects any fiber jF t . Table [1] shows the number of square- 
free moves of the Graver bas is for I\ x I 2 x I 3 three-way complete independence model 
computed via 4ti2 (14ti2 teaml ). We see that when the number of levels is greater than two, 
the sets include moves with degree greater than two. This fact does not necessarily imply 
that higher degree moves are required to connect every fiber for the three-way complete 
independence model. However we can give an example which shows that the degree two 
moves do not connect all fibers of the three-way complete independence model. 



Table 1: The number of square-free moves of the Graver basis for three-way complete 
independence model 

degree of moves 



h X J 2 X I 3 


2 


3 


4 


5 


6 


2x2x2 


12 














2x2x3 


33 


48 











2x2x4 


64 


192 


96 








2x2x5 


105 


480 


480 








2x3x3 


90 


480 


396 








2x3x4 


174 


1632 


5436 


1152 





2x3x5 


285 


3840 


23220 


33120 


720 


3x3x3 


243 


3438 


19008 


12312 






Example 3.1 (A fiber for 3x3x3 three-way complete independence model). Consider the 
following two zero-one tables x and y in the same fiber of three-way complete independence 
model. 

k 





12 3 


1 

x:- . 2 
3 



1 
1 




i = 1 
k 

1 2 3 


1 

3 




1 1 


i = 1 
of the two tables is 


z = 



1 
0-10 








1 





-1 

























1 








1 





1 


1 









-1 











1 
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and we can easily check that z is a move for the thre e-way comp l ete in dependence model. 

Let A be the set of degenerate variables defined in lHara et all \200Ti . Then degree two 
moves for three-way complete independence model are classified into the following four 
patterns. 



1. A = {1,2,3} : 







*3 






i's 




1 







-1 





i> 2 





-1 


L 2 





1 






k 






k 



2. A = {1,2} : 



3. A — {1,3} 



I A = {2,3} 



ii 





k 


i's 




i-3 


i's 




1 





i2 





-1 


i> 2 


-1 





i> 2 





1 






k 






k 




k 






^3 




k 


1 


-1 


12 


-1 


1 










l 2 












k 






k 




k 










k 


1 


-1 








i' 2 


-1 


1 









However it is easy to check that if we apply any move in this class to x or y, —1 or 2 
has to appear. Therefore we cannot apply any degree two moves to both x and y. Hence 
a degree three move is required to connect this fiber. 

This example indicates that it may be difficult to obtain a set of moves which connects 
every fiber of the many-facet Rasch model theoretically. As seen in Table [H the number 
of square-free moves in the Graver basis is too large even for three-way tables. When 
the number of cells is greater than 100, it seems to be difficult to compute the Graver 
basis via 4ti2 in a practical length of time. Hence implementations of exact tests by using 
the Graver basis is limited to very small models at this point. The clarification of the 
structure of the set of moves which connects all zero-one fibers for more general many- 
facet Rasch model is important to implement exact tests. However this problem seems to 
be difficult at this point and is left CIS db future task. 



3.3 Two-way zero-one tables with structural zeros 

Two-way zero-one tables with structural zeros arise in many practical problems, including 
ecological studies and social networks. Let x = {xij} be an Jx J zero-one table and denote 
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by S C X the set of cells tha t are not structural zeros. We consider the quasi-independence 
model ( Bishop et al. 1975| ) CIS db null hypothesis, 



log P(Xi 

P(x. 



0. 



a,- 



A- 



] 1 



(hj)eS 
otherwise. 



(3.4) 



The sufficient statistic t for the models is the set of row and column sums. Denote by 
B(S) the set of moves for the quasi-independence model (13. 1J) . Then is written by 

S(S') = {z = {zij} | 2 i+ = z +j = 0, Zjj = for e S c }. 

We denote a structu ral zero cell by [0] to distinguish it from a sampling zero cell. 



Rao et al.1 [1996l | discussed the connectivity of zero-one tables in the case where I = J 
and all the diagonal elem ents are structural zeros and provided a Markov basis for zero- 
one tables. iRobertsI 2000| applied the results to analyses of social networks and proposed 
an efficient implementation of exact tests of quasi-independent model (13.41) via MCMC. 
Chen! 20071 ] proposed a procedure for implementing exact tests via sequential importance 
sampling for general two-way zero-one tables with structural zeros. In this section we ex- 



tend the argument of iRao et al.l [19961 ] to general two-way zero-one tables with structural 
zeros and provide a Markov basis for zero-o ne tables in the quasi - indep endence model. 

For general two-way contingency tables, lAoki and Takemural 20051 ] provided a com- 
plete description of the unique minimal Markov basis for the quasi-independence model 
(13. 4p . A loop z r (i[ r y, j[ r ]) is defined as in Section |3~T1 When z r (i[ r y, j[ r ]) is a move in B(S), 
z r (i[ r y j[ r ]) is called a loop on S. 

Definition 3.1 ( Aoki and Takemura 20051 ]). A loop z r (i\ r y, j\ r ] 
I(i[ r yj[ r ]) does not contain support of any loop on S of degree 2, 



on S is called df 1 if 
. , r — 1, where 



I ( i [ry 3[r\) = {(hj) I i e {h, . . .,ir},j e {ji, . . .,>}}■ 

z r (i\ r y j\ r ]) is df 1 if and only ifT(i\ r y jw) contains exactly two elements in S in every row 
and column. 

The following integer arrays are examples of df 1 loops of degree two, three and four 
on some S. 
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+1 








[0] 


+1 


-1 
























+1 


-1 


[0] 


[0] 





-1 


[0] 


+1 


[0] 





[0] 


+1 


[0] 


-1 





[0] 


[0] 


-1 


+1 






(3.5) 



We note that a degree 2 loop Zz(ii, H'i3\i32) is a basic move. 

Denote by B m (S) the set of df 1 loops of degree 2, . . . , min{/, J}. For general con- 
tingency tables, Aoki and Takemura 20051 ] showed that Bdn(S) forms two-way unique 
minimal Markov basis for the quas i-independence model (13 .4p . By following the argu- 



ment in lAoki and Takemural 20051 ] . however, we can also prove that £>dfi(S) connects 



every fiber Tt of zero-one tables. 
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Theorem 3.1. The set of df 1 loops of degree 2, . . . , min{/, J} connects every fiber Tt of 
zero-one tables of the quasi-independence model flff.^p . 



The proof is in the same way as the proof of Theorem 1 in lAoki and Takemural [2005 
and omitted here. 



As discussed in Section 5 in lAoki and Takemural [2005| , in the case of square tables 



with diagonal elements being structural zeros, B^n (S) contains b asic moves and df 1 loops 
of degree 3 which coincides with the results of iRao et al.l 1996 . 



3.4 Latin squares and zero-one tables for no-three- fact or- interact ion 
models 

Zero-one tables also appear quite often in the form of incidence matrices for combinatorial 
problems. Here as an example we consider Latin squares. A Latin square is an n x n 
table filled with n different symbols in such a way that each symbol occurs exactly once 
in each row and column. A 3 x 3 Latin square is written by 



(3.6) 



When the symbols of an n x n Latin square are considered as coordinates of the third 
axis (sometimes called the orthogonal array representation of a Latin square), it is a 
particular element of a fiber for the n x n x n no-three-factor-interaction model with all 
two-dimensional marginals (line sums) equal to 1. For example, the 3x3 Latin square 
( 13. 6|) is considered as a 3 x 3 x 3 zero-one 




«2 




«2 
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x = i\ 
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' ^l 


1 







1 







1 










1 




1 





H = 


1 


^3 = 


2 


*3 = 


3 



(3.7) 



with Xj 1 



82" 



1, x + 



12*3 



X 



1 for all i\, %2 and One of the reasons 



to consider a Markov b asis for Latin squares is to generate a Latin square randomly. 
Fisher and Yates 1934] advocated to choose a Latin square randomly from the set of 



Jacobson and Matthews! [19961 ] gave a Markov basis for the set of n x n 



Latin squares. 
Latin squares. 

Because the set of Latin squares is just a particular fiber, it may be the case that a 
minimal set of moves connecting all Latin squares is smaller to the set of moves connecting 
all zero-one tables. This is indeed the case as we show for the simple case of n = 3. We 
first present a connectivity result for 3x3x3 zero-one tables with all line sums fixed. 
Let z = tzj,jk}i,j,k=-\ .2 a be a move for 3x3x3 no-thee-factor-interaction model. From 



Diaconis and Sturmfels 1998 and Aoki and Takemura 2003 the minimal Markov basis 
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consists of basic moves such as 
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z = 


-1 
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-1 











































(3.8) 



and degree 6 moves such as 
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-1 





z = 
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-1 































(3.9) 



However these moves do not connect zero-one tables of the 3x3x3 no-three-factor- 
interaction model. We need the following type of degree 9 move, which corresponds to 
the difference of two Latin squares. 
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z = 
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-1 
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-1 



(3.10) 



Proposition 3.1. The set of basic moves Ii3. 8\) . degree 6 moves Ii3. 9\) and degree 9 moves 
li3.1U\) forms a Markov basis for 3x3x3 zero-one tables for the no-three-factor-interaction 
model. 

Proof. Consider any line sum, such as = z + n = z m + z 211 + 2 3 n of a move z. If 
(^111,^211,^311) 7^ (0,0,0), then we easily see t hat ( zm, z 2 ii, Z311} = {-1,0,1}. By a 



similar consideration as in lAoki and Takemural [2003| , each i- or j- or /c-slice is either a 



loop of degree two or loop of degree three, such as 
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or 
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-1 
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(3.11) 



Now we consider two cases: 1) there exists a slice with a loop of degree two, or 2) all 
slices are loops of degree three. 

Case 1. Without loss of generality, we can assume that the (i = l)-slice of z is the loop 
of degree two in ( 13.111) . Then we can further assume that 2:211 = — 1 and 2311 = 0. Now 
suppose that 2222 = — 1- If -2212 = 1 or 2221 = 1, then this constitutes a strong crossing 
pattern of Condition 12.11 and we can reduce \z\ by a basic move. This implies 2212 = 
2:221 = 0. But then 2213 = 2:223 = 1 and this contradicts the pattern of {2213, -2223, -2233} = 
{-1,0,1}. 

By the above consideration we have 2:222 = and 2:322 = — 1- By a similar consideration 
for the cells 2ji2 an d z^x, i = 1, 2, 3, we easily see that z is of the form 
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which is a degree 6 move. 

Case 2. It is easily seen that the only case where degree 6 moves can not be applied is 
of the form of the move of degree 9 in (13.101) . This proves that connectivity is guaranteed 
if we add degree 9 moves. 

We also want to show that degree 9 moves are needed for connectivity. Consider 
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X = 
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By a simple program it is easily checked that if we apply any basic move or any move 
of degree 6 to x, —1 or 2 has to appear. Hence degree 9 moves are required to connect 
zero-one tables. 

□ 



Now consider 3x3 Latin squares (13.71). It is well-known that ther e is only one isotopy 
class of 3 x 3 Latin squares (Chapter III of IColbourn and Dinitzl |2007j |). i.e., all 3 x 3 Latin 
squares are connected by the action of the direct product £3 x S 3 x S3 of the symmetric 
group S3 which is generated by transpositions, and a transposition corresponds to a move 
of degree 6 in (13. 9p . Therefore, the set of 3 x 3 Latin squares in the orthogonal array 
representation is connected by the set of moves of degree 6 in Ii3.9\) . In view of Proposition 
13. 1| we see that we do not need basic moves nor degree 9 moves for connecting 3x3 Latin 
squares. 



Th ere are two isotopy classes for 4x4 Latin squares (1.18 of III. 1.3 of lColbourn and Dinitz 
2007| ) and representative elements of these two classes are connected by a basic move. 



Transposition of two levels for a factor corresponds to a degree 8 move of the following 
form. 
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Therefore the set of 4 x 4 Latin squares is connected by the set of basic moves and moves 
of degree 8 of the above form. We can apply a similar con sideration to the celebr ated 
result of 22 isotopy classes of 6 x 6 Latin squares derived by iFisher and Yates! 1934 . 



4 Concluding remarks 

In this paper we discussed Markov bases for tables with zero-one entries. We derived 
several general results, where a particular subset of the Graver basis connects zero-one 
tables. However, in general, we found that a Markov basis for zero-one tables is difficult 
and requires separate arguments for each model. We obtained Markov bases for zero-one 
tables for some common models of contingency tables. 
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Rapallo and Yoshidal 20091 ] gave some results for contingency tables with bounded 
entries, in particular for the case of two-way tables with structural zeros. A zero-one 
table is a particular case of contingency tables with bounded entries. If the bound is large 
enough, compared to the sample size of a particular fiber, it seems that the bound is not 
binding. In this sense the bound of 1 in our case seems to be most stringent. On the 
other hand, our proof of Proposition 13.11 suggests that a Markov basis for zero-one tables 
may have a simple structure. It is an interesting problem to describe how Markov bases 
behave as we vary the upper bound for the cells. 

In Section 13.41 we considered Latin squares. It is of interest to consider other combi- 
na torial designs, such as t he Su doku. A Markov basis for Sudoku designs is c onsidered 



in 



Fontana and Rogantinl [20091 ] and their invariance structure is discussed in ISei et al. 



20091 ] . It is a challenging problem to derive a Markov basis for the ordinary 3x3x3x3 
Sudoku. 
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